Generating a 3D shape template of a moving and deforming object from an RGB-D image sequence
نویسندگان
چکیده
Automatically reconstructing a 3D shape model of a nonrigid object using a sequence from a single commodity RGB-D sensor is a challenging problem. Some techniques use a 3D shape template of a target object; however, in order to generate the template automatically, the target object required to be stationary. Otherwise, a non-rigid ICP algorithm, which registers a pair of point clouds, can be used for reconstructing 3D geometry of a non-rigid object directly, but it often fails due to the ambiguity in point correspondences. This paper presents a method for generating a 3D shape template from a single RGB-D sequence. In order to reduce the ambiguity in point correspondences, our method leverages point trajectories obtained in the RGB images, which can be used for associating points in different point clouds. We demonstrate the capability of our method using deforming human bodies. Introduction Recently, various applications that present a moving and deforming object to users, such as virtual fitting room [1] and virtual pets [2], have become available to ordinary users. These applications render objects based on 3D geometry of their entire shapes (which we refer to as full-body shape models) and their non-rigid motion, both of which are usually handcrafted. Automatic techniques for reconstructing full-body shape models at each frame can drastically reduce the cost for creating a 3D shape model and motion (e.g., [3, 4]). They use multiple sensors (e.g., RGB or RGB-D sensors), whose relative poses are known, to capture the object from different viewpoints simultaneously. It then applies an existing 3D reconstruction technique for rigid objects, such as [5, 6, 7]. However, the use of multiple sensors may be still cumbersome for some applications in which ordinary users need their own shape models and motions. Reconstructing 3D shape and motion from a single sensor is a challenging problem. Two approaches have been proposed: one uses 3D shape templates of the target object and the other does not. Former approach [8, 9] generates a 3D shape template using a 3D shape reconstruction technique for rigid objects [5, 6, 7], assuming the object is almost stationary. They then fit the 3D shape template to a 3D point cloud at each frame of a single depth map sequence. One major limitation of this approach is that it requires an extra burden to capture the target object while it is stationary, which is practically infeasible, especially for objects like animals. Latter approach [10, 11] registers 3D point clouds in all frame of a single depth map sequence to any other frames using non-rigid iterative closest point (ICP) [12, 13]; however, it often 3D shape template
منابع مشابه
3D shape template generation from RGB-D images capturing a moving and deforming object
Automatically reconstructing a 3D shape model of a nonrigid object using a sequence from a single commodity RGB-D sensor is a challenging problem. Some techniques use a 3D shape template of a target object; however, in order to generate the template automatically, the target object required to be stationary. Otherwise, a non-rigid ICP algorithm, which registers a pair of point clouds, can be us...
متن کاملمدلسازی صفحهای محیطهای داخلی با استفاده از تصاویر RGB-D
In robotic applications and especially 3D map generation of indoor environments, analyzing RGB-D images have become a key problem. The mapping problem is one of the most important problems in creating autonomous mobile robots. Autonomous mobile robots are used in mine excavation, rescue missions in collapsed buildings and even planets’ exploration. Furthermore, indoor mapping is beneficial in f...
متن کاملSegmentation Assisted Object Distinction for Direct Volume Rendering
Ray Casting is a direct volume rendering technique for visualizing 3D arrays of sampled data. It has vital applications in medical and biological imaging. Nevertheless, it is inherently open to cluttered classification results. It suffers from overlapping transfer function values and lacks a sufficiently powerful voxel parsing mechanism for object distinction. In this work, we are proposing an ...
متن کاملE cient Reconstruction of Non-rigid Shape and Motion from Real-Time 3D Scanner Data
We present a new technique for reconstructing a single shape and its non-rigid motion from 3D scanning data. Our algorithm takes a set of time-varying unstructured sample points that show partial views of a deforming object as input and reconstructs a single shape and a deformation eld that t the data. This representation yields dense correspondences for the whole sequence, as well as a complet...
متن کاملPixel2Mesh: Generating 3D Mesh Models from Single RGB Images
We propose an end-to-end deep learning architecture that produces a 3D shape in triangular mesh from a single color image. Limited by the nature of deep neural network, previous methods usually represent a 3D shape in volume or point cloud, and it is non-trivial to convert them to the more ready-to-use mesh model. Unlike the existing methods, our network represents 3D mesh in a graph-based conv...
متن کامل